NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Metric-Agnostic Continual Learning for Sustainable Group Fairness

https://doi.org/10.1609/aaai.v39i18.34052

Lian, Heng; Zhao, Chen; Chen, Zhong; Zhu, Xingquan; Thai, My T; He, Yi (April 2025, Proceedings of the AAAI Conference on Artificial Intelligence)

Group Fairness-aware Continual Learning (GFCL) aims to eradicate discriminatory predictions against certain demographic groups in a sequence of diverse learning tasks.This paper explores an even more challenging GFCL problem – how to sustain a fair classifier across a sequence of tasks with covariate shifts and unlabeled data. We propose the MacFRL solution, with its key idea to optimizethe sequence of learning tasks. We hypothesize that high-confident learning can be enabled in the optimized task sequence, where the classifier learns from a set of prioritized tasks to glean knowledge, thereby becoming more capable to handle the tasks with substantial distribution shifts that were originally deferred. Theoretical and empirical studies substantiate that MacFRL excels among its GFCL competitors in terms of prediction accuracy and group fair-ness metrics.
more » « less
Free, publicly-accessible full text available April 11, 2026
Metric-Agnostic Continual Learning for Sustainable Group Fairness

Lian, Heng; Zhao, Chen; Chen, Zhong; Thai, My T; Zhu, Xingquan; He, Yi (February 2025, AAAI 2025 Conference)

Free, publicly-accessible full text available February 25, 2026
HGDL: Heterogeneous Graph Label Distribution Learning

Jin, Yufei; Lian, Heng; He, Yi; Zhu, Xingquan (December 2024, Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, NeurIPS 2024, Vancouver, BC, Canada, December 10 - 15, 2024. 2024)
Globersons, Amir; Mackey, Lester; Belgrave, Danielle; Fan, Angela; Paquet, Ulrich; Tomczak, Jakub M; Zhang, Cheng (Ed.)
Full Text Available
Utilitarian Online Learning from Open-World Soft Sensing

https://doi.org/10.1109/ICDM59182.2024.00031

Lian, Heng; Huang, Yu; Zhu, Xingquan; He, Yi (December 2024, IEEE International Conference on Data Mining, ICDM 2024, Abu Dhabi, United Arab Emirates, December 9-12, 2024. IEEE 2024, ISBN 979-8-3315-0668-1)

Full Text Available
HGDL: Heterogeneous Graph Label Distribution Learning

Jin, Yufei; Lian, Heng; He, Yi; Zhu, Xingquan (October 2024, The 38th Annual Conference on Neural Information Processing Systems (NeurIPS 2024))

Label Distribution Learning (LDL) has been extensively studied in IID data applications such as computer vision, thanks to its more generic setting over single-label and multi-label classification. This paper advances LDL into graph domains and aims to tackle a novel and fundamental heterogeneous graph label distribution learning (HGDL) problem. We argue that the graph heterogeneity reflected on node types, node attributes, and neighborhood structures can impose significant challenges for generalizing LDL onto graphs. To address the challenges, we propose a new learning framework with two key components: 1) proactive graph topology homogenization, and 2) topology and content consistency-aware graph transformer. Specifically, the former learns optimal information aggregation between meta-paths, so that the node heterogeneity can be proactively addressed prior to the succeeding embedding learning; the latter leverages an attention mechanism to learn consistency between meta-path and node attributes, allowing network topology and nodal attributes to be equally emphasized during the label distribution learning. By using KL-divergence and additional constraints, HGDL delivers an end-to-end solution for learning and predicting label distribution for nodes. Both theoretical and empirical studies substantiate the effectiveness of our HGDL approach.
more » « less
Full Text Available
Online Outlier Detection in Open Feature Spaces

https://doi.org/10.1109/TKDE.2025.3593895

Lian, Heng; He, Yi; Wu, Di; Chen, Zhong; Zhu, Xingquan; Wu, Xindong (October 2025, IEEE Transactions on Knowledge and Data Engineering)

Free, publicly-accessible full text available October 1, 2026
Online Learning from Evolving Feature Spaces with Deep Variational Models

https://doi.org/10.1109/TKDE.2023.3326365

Lian, Heng; Wu, Di; Hou, Bo-Jian; Wu, Jian; He, Yi (January 2024, IEEE Transactions on Knowledge and Data Engineering)

In this paper, we explore a novel online learning setting, where the online learners are presented with “doubly-streaming” data. Namely, the data instances constantly streaming in are described by feature spaces that over-time evolve, with new features emerging and old features fading away. The main challenge of this problem lies in the fact that the newly emerging features are described by very few samples, resulting in weak learners that tend to make error predictions. A seemingly plausible idea to overcome the challenge is to establish a relationship between the old and new feature spaces, so that an online learner can leverage the knowledge learned from the old features to better the learning performance on the new features. Unfortunately, this idea does not scale up to high-dimensional feature spaces that entail very complex feature interplay. Specifically. a tradeoff between onlineness, which biases shallow learners, and expressiveness, which requires deep models, is inevitable. Motivated by this, we propose a novel paradigm, named Online Learning Deep models from Data of Double Streams (OLD3S), where a shared latent subspace is discovered to summarize information from the old and new feature spaces, building an intermediate feature mapping relationship. A key trait of OLD3S is to treat the model capacity as a learnable semantics, aiming to yield optimal model depth and parameters jointly in accordance with the complexity and non-linearity of the input data streams in an online fashion. To ablate its efficacy and applicability, two variants of OLD3S are proposed namely, OLD-Linear that learns the relationship by a linear function; and OLD-FD learns that two consecutive feature spaces pre-and-post evolution with fixed deep depth. Besides, instead of re-starting the entire learning process from scratch, OLD3S learns multiple newly emerging feature spaces in a lifelong manner, retaining the knowledge from the learned and vanished feature space to enjoy a jump-start of the new features’ learning process. Both theoretical analysis and empirical studies substantiate the viability and effectiveness of our proposed approach.
more » « less
Full Text Available
A general framework for frequentist model averaging

https://doi.org/10.1007/s11425-018-9403-x

Mitra, Priyam; Lian, Heng; Mitra, Ritwik; Liang, Hua; Xie, Min-ge (February 2019, Science China Mathematics)

Full Text Available
Dimensionality Reduction and Variable Selection in Multivariate Varying-Coefficient Models With a Large Number of Covariates

https://doi.org/10.1080/01621459.2017.1285774

He, Kejun; Lian, Heng; Ma, Shujie; Huang, Jianhua Z. (February 2017, Journal of the American Statistical Association)

Full Text Available

Search for: All records